Expanding the Scope of the ATIS Task: The ATIS-3 Corpus

نویسندگان

  • Deborah A. Dahl
  • Madeleine Bates
  • Michael Brown
  • William M. Fisher
  • Kate Hunicke-Smith
  • David S. Pallett
  • Christine Pao
  • Alexander I. Rudnicky
  • Elizabeth Shriberg
چکیده

The Air Travel Information System (ATIS) domain serves as the common evaluation task for ARPA"spoken language system developers. 1 To support this task, the Multi-Site ATIS Data COllection Working group (MADCOW) coordinates data collection activities. This paper describes recent MADCOW activities. In particular, this paper describes the migration of the ATIS task to a richer relational database and development corpus (ATIS-3) and describes the ATIS-3 corpus. The expanded database, which includes information on 46 US and Canadian cities and 23,457 flights, was released in the fall of 1992, and data collection for the ATIS-3 corpus began shortly thereafter. The ATIS-3 corpus now consists of a total of 8297 released training utterances and 3211 utterances reserved for testing, collected at BBN, CMU, MIT, NIST and SRI. 2906 of the training utterances have been annotated with the correct information from the database. This paper describes the ATIS-3 corpus in detail, including breakdowns of data by type (e.g. context-independent, context-dependent, and unevaluable)and variations in the data collected at different sites. This paper also includes a description of the ATIS-3 database. Finally, we discuss future data collection and evaluation plans.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Session 7: Demonstrations And Videos

1. Robus t Speech Unders tanding in the ATIS Domain Carnegie Mellon University 2. Real-Time ATIS System BBN Systems & Technologies 3. Software ATIS System SRI International 4. Interactive Booking in the ATIS Domain MIT Laboratory for Computer Science 5. Real-Time Speech Recognition For Controllers BBN Systems & Technologies 6. Speech Research and Development at Apple Apple Computer 7. Expanding...

متن کامل

NIST-ARPA Interagency Agreement: Human Language Technology Program

PROJECT GOALS 1. To coordinate the design, development and distribution of speech and natural language corpora for the ARPA Spoken Language research community, and the use of these corpora for technology development and evaluation. 2. To design, coordinate the implementation of, and analyze the results of performance assessment benchmark tests for ARPA's speech recognition and spoken language u...

متن کامل

DARPA February 1992 ATIS Benchmark Test Results

This paper documents the third in a series of Benchmark Tests for the DARPA Air Travel Information System (ATIS) common task domain. The first results in this series were reported at the June 1990 Speech and Natural Language Workshop [1], and the second at the February 1991 Speech and Natural Language Workshop [2]. The February 1992 Benchmark Tests include: (1) ATIS domain spontaneous speech re...

متن کامل

Corpus Collection for ATIS

The project goal is to collect and deliver a corpus of speech data that supports DARPA SL~ system development. As of February 1991, SRI has set up a hardware and software environment for the collection of spoken interactions with a simulated Air Travel Information System (ATIS), established a data collection procedure, collected and dis~buted prototype data, and evaluated the prototype data wit...

متن کامل

Benchmark Tests For The Darpa Spoken Language Program

This paper documents benchmark tests implemented within the DARPA Spoken Language Program during the period November, 1992 January, 1993. Tests were conducted using the Wall Street Journal-based Continuous Speech Recognition (WSJ-CSR) corpus and the Air Travel Information System (ATIS) corpus collected by the Multi-site ATIS Data COllection Working (MADCOW) Group. The WSJ-CSR tests consist of t...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1994